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Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) OR THIRTY (30) DAYS, 
WHICHEVER IS LONGER, FROM THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1.136(a). In no event, however, may a reply be timely filed 
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earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )[3 Responsive to communication(s) filed on 03 December 2003 . 
2a)D This action is FINAL. 2b® This action is non-final. 

3)D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 
closed in accordance with the practice under Ex parte Quayle, 1935 CD. 1 1 , 453 O.G. 213. 



is/are withdrawn from consideration. 
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4) S Claim(s) 1-31 is/are pending in the application. 
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5) D Claim(s) is/are allowed. 
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DETAILED ACTION 

1. Claims 1-31 are pending. 

Drawings 

2. Figure 1 should be designated by a legend such as -Prior Art- because only 
that which is old is illustrated. See MPEP § 608.02(g). Corrected drawings in 
compliance with 37 CFR 1.121(d) are required in reply to the Office action to avoid 
abandonment of the application. The replacement sheet(s) should be labeled 
"Replacement Sheet" in the page header (as per 37 CFR 1 .84(c)) so as not to obstruct 
any portion of the drawing figures. If the changes are not accepted by the examiner, the 
applicant will be notified and informed of any required corrective action in the next Office 
action. The objection to the drawings will not be held in abeyance. 



Claim Objections 

3. Claim 5 is objected to for the phrase: "wherein each thread uses writes disparate 
flag and status bits", where it appears that the word 'uses' is unnecessary. Appropriate 
correction is required. 

4. Claim 1 1 is objected to for referring to a program counter, which has not been 
introduced in the claim chain. Appropriate correction is required. 
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5. Claim 14 is objected to for stating: "wherein context circuitry is a context register", 
when no context circuitry has been introduced in Claim 1 . Appropriate correction is 
required. 

Claim Rejections - 35 USC § 102 

6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

7. Claims 1-2, 19-20, and 27-28 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Owen et al. (USPN 4,876,660, herein Owen). 

8. As per Claim 1 , Owen teaches: A processor, comprising: 

a plurality of registers (Column 8, Lines 24-26, Accumulators 40 and 42. Also see 
Column 9, Lines 19-20), wherein each of the plurality of registers comprises data bits for 
holding data and one or more flag and status bits for holding carry flag information 
(Column 4, Lines 27-31 and Column 9, Lines 33-34 and 40 disclose the flag and status 
bits, while Column 9, Lines 29-34 disclose that the register is holding the data and the 
status bits/flags); 

an arithmetic logic unit (Figure 6a, Adder 34, also see Column 9, Lines 38-41), 
the arithmetic logic unit coupled to the plurality of registers, wherein the arithmetic logic 
unit is configured to write data and carry flag information to one of the plurality of 
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registers (Column 9, Lines 38-47, and Column 8, Lines 28-30, which states the data can 
be written into either one or both of the registers). 

9. As per Claim 2, Owen teaches: The processor of claim 1 , wherein the one or 
more flag and status bits hold overflow flag information (Column 4, Lines 27-31 , and 
Column 9, Lines 36-37). 

10. As per Claim 19, Owen teaches: A method of processing data, comprising: 
providing data to an arithmetic logic unit (Figure 6a, Adder 34, also see Column 

9, Lines 38-41) associated with a processor, the data obtained from registers (Column 
8, Lines 30-40, which disclose how the registers feedback into the adder) comprising 
data bits for holding data and one or more flag and status bits for holding carry flag 
information (Column 4, Lines 27-31 and Column 9, Lines 33-34 and 40 disclose the flag 
and status bits, while Column 9, Lines 29-34 disclose that registers (Accumulator 40 or 
42) are holding the data and the status bits/flags); 

processing the data at the arithmetic logic unit (Column 9, Lines 38-43), the 
arithmetic logic init coupled to the plurality of registers, wherein the arithmetic logic unit 
is configured to write data and carry flag information to a result register (Column 9, 
Lines 38-47, and Column 8, Lines 28-30, which states the data can be written into either 
one or both of the registers). 
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11. As per Claim 20, Owen teaches: The method of claim 1 9, wherein the one or 
more flag and status bits hold overflow flag information (Column 4, Lines 27-31, and 
Column 9, Lines 36-37). 

12. As per Claim 27, Owen teaches: A programmable chip, comprising: 

means for providing data to an arithmetic logic unit associated with a processor 
on the programmable chip (Column 8, Lines 30-40, which disclose how the registers 
feedback into the adder), the data obtained from registers comprising data bits for 
holding data and one or more flag and status bits for holding carry flag information 
(Column 4, Lines 27-31 and Column 9, Lines 33-34 and 40 disclose the flag and status 
bits, while Column 9, Lines 29-34 disclose that the register is holding the data and the 
status bits/flags); 

processing the data at the arithmetic logic unit (Column 9, Lines 38-43), the 
arithmetic logic unit coupled to the plurality of registers, where the arithmetic logic unit is 
configured to write data and carry flag information to a result register (Column 9, Lines 
38-47, and Column 8, Lines 28-30, which states the data can be written into either one 
or both of the registers). 

13. As per Claim 28, Owen teaches: The programmable chip of claim 27, wherein 
the one or more flag and status bits hold overflow flag information (Column 4, Lines 27- 
31, and Column 9, Lines 36-37). 
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Claim Rejections - 35 USC § 103 

14. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1 5. Claims 3,21, and 29 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Owen, in view of Baldwin (USPN 5,042,000, herein Baldwin). 

16. As per Claim 3, Owen teaches the processor of claim 2, but fails to explicitly 
teach: wherein the one or more flag bits hold sticky overflow flag information. Owen 
teaches an overflow bit, but does not specify whether it is a "sticky" overflow bit or not. 
Baldwin teaches that a "sticky" status bit may be used in situations where it is preferable 
to monitor fault conditions separately from the primary error-handling mechanism 
(Column 95, Lines 50-64). The advantage of this, as Baldwin discloses, is being able to 
see and operate on the flag after a series of operations, instead of checking at each 
individual operation. Given these advantages, one of ordinary skill in the art at the time 
the invention was made would have implemented Baldwin's "sticky" overflow bit into 
Owens invention if they wanted to bypass the normal error-handling mechanisms for 
certain series of instructions. 

17. As per Claim 21 , Owen teaches the method of claim 20, but fails to explicitly 
teach: wherein the one or more flag bits hold sticky overflow flag information. Owen 
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teaches an overflow bit, but does not specify whether it is a "sticky" overflow bit or not. 
Baldwin teaches that a "sticky" status bit may be used in situations where it is preferable 
to monitor fault conditions separately from the primary error-handling mechanism 
(Column 95, Lines 50-64). The advantage of this, as Baldwin discloses, is being able to 
see and operate on the flag after a series of operations, instead of checking at each 
individual operation. Given these advantages, one of ordinary skill in the art at the time 
the invention was made would have implemented Baldwin's "sticky" overflow bit into 
Owens invention if they wanted to bypass the normal error-handling mechanisms for 
certain series of instructions. 

18. As per Claim 29, Owen teaches the programmable chip of claim 28, but fails to 
explicitly teach: wherein the one or more flag bits hold sticky overflow flag information. 
Owen teaches an overflow bit, but does not specify whether it is a "sticky" overflow bit or 
not. Baldwin teaches that a "sticky" status bit may be used in situations where it is 
preferable to monitor fault conditions separately from the primary error-handling 
mechanism (Column 95, Lines 50-64). The advantage of this, as Baldwin discloses, is 
being able to see and operate on the flag after a series of operations, instead of 
checking at each individual operation. Given these advantages, one of ordinary skill in 
the art at the time the invention was made would have implemented Baldwin's "sticky" 
overflow bit into Owens invention if they wanted to bypass the normal error-handling 
mechanisms for certain series of instructions. 
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19. Claims 4-14,18, 21, 22-26, and 30-31 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Owen, in view of Parady (USPN 5,933,627). 

20. As per Claim 4, Owen teaches the processor of claim 1 , but fails to explicitly 
teach: wherein the processor is a multithreaded processor. 

Owen teaches a processor, namely a multiplier-accumulator unit and the 
supporting hardware to execute arithmetic operations, but does not teach about the 
overall system it could be implemented on. Parady teaches a multithreaded 
microprocessor (Parady Column 2 Lines 18-20) which interleaves threads on a cycle- 
by-cycle basis in round robin fashion (Column 3, Lines 18-19, and Column 2, Lines 6-8, 
which state that fine-grained processors interleave on a cycle-by-cycle basis, Column 2, 
Lines 32-34, and Column 4, Lines 9-10, the embodiment in which round-robin is used). 
Owen teaches that the advantage of his system is higher density, which leads to fewer 
parts, lower power requirements, and increases system speed due to interconnection 
delay being reduced (Column 3, Lines 32-40). Given these advantages, it would have 
been obvious to one of ordinary skill in the art at the time the invention was made to 
include the multiplier-accumulator circuit of Owens design into the multithreaded system 
of Parady's, to enhance the speed and decrease the power consumption of Parady's 
system. 

21 . As per Claim 5, Parady teaches: The processor of claim 1 , wherein providing flag 
and status information in each of the plurality of registers allows the processor to handle 
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a plurality of threads, wherein each thread uses writes disparate flag and status bits 
(Column 1, Lines 17-20. In a multi-threaded environment, each thread is working on a 
different part of the program, and would therefore be not be writing the same flag and 
status bits). 

22. As per Claim 6, Parady teaches: The processor of claim 5, wherein the plurality 
of threads are each associated with a subset of the plurality of registers (Column 2, 
Lines 35-36). 

23. As per Claim 7, Parady teaches: The processor of claim 6, further comprising 
context circuitry associated with the plurality of threads, wherein the context circuitry is 
operable to track which of the plurality of threads is being processed by a particular 
stage (Column 3, Line 57 - Column 4, Line 30). 

24. As per Claim 8, Parady teaches: The processor of claim 7, wherein the arithmetic 
logic unit is operable to process the plurality of threads, the arithmetic logic unit 
configured to process a particular thread for a single clock cycle before context 
switching to process a next thread, wherein each of the plurality of threads are 
processed for the single clock cycle before context switching (Column 4, Lines 9-10). 

25. As per Claim 9, Parady teaches: The processor of claim 1 , wherein the processor 
is a processor core on a programmable chip (Column 1, Lines 11-12). 
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26. As per Claim 10, Parady teaches: The processor of claim 8, wherein the 
particular thread is processed for a single clock cycle to allow a memory access to 
complete before continuing processing of the particular thread (Column 4, Lines 42-48). 

27. As per Claim 1 1 , Parady teaches: The processor of claim 1 0, where the program 
counters track the processing location of each thread (Column 3, Lines 50-55). 

28. As per Claim 12, Parady teaches: The processor of claim 1 , wherein the plurality 
of registers are configured as memory (Column 2, Lines 25-26) on a programmable chip 
(Column 1, Lines 11-12). 

29. As per Claim 1 3, Parady teaches: The processor of claim 1 , wherein the 
processor is a central processing unit or a digital signal processor (Figures 1 and 2, the 
Ultrasparc processor, where a CPU is defined by the Dictionary of Computers, 
Information Processing & Telecommunications as "a unit of a computer that includes 
circuits controlling the interpretation and execution of instructions). 

30. As per Claim 14, Parady teaches: The processor of claim 1 , wherein context 
circuitry is a context register (Column 4, Lines 9-12). 
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31 . As per Claim 1 8, Parady teaches: The processor of claim 1 0, further comprising 
a plurality of program counters associated with the plurality of threads (Column 2, Lines 
25-32, the program address registers). 

32. As per Claim 22, Owen teaches the method of claim 1 9, but fails to explicitly 
teach: wherein the processor is a multithreaded processor. 

Owen teaches a processor, namely a multiplier-accumulator unit and the 
supporting hardware to execute arithmetic operations, but does not teach about the 
overall system it could be implemented on. Parady teaches a multithreaded 
microprocessor (Parady Column 2 Lines 18-20) which interleaves threads on a cycle- 
by-cycle basis in round robin fashion (Column 3, Lines 18-19, and Column 2, Lines 6-8, 
which state that fine-grained processors interleave on a cycle-by-cycle basis, Column 2, 
Lines 32-34, and Column 4, Lines 9-10, the embodiment in which round-robin is used). 
Owen teaches that the advantage of his system is higher density, which leads to fewer 
parts, lower power requirements, and increases system speed due to interconnection 
delay being reduced (Column 3, Lines 32-40). Given these advantages, it would have 
been obvious to one of ordinary skill in the art at the time the invention was made to 
include the multiplier-accumulator circuit of Owens design into the multithreaded system 
of Parady's, to enhance the speed and decrease the power consumption of Parady's 
system. 
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33. As per Claim 23, Parady teaches: The method of claim 19, where providing flag 
and status information in each of the plurality of registers allows the processor to handle 
a plurality of threads, wherein each thread uses writes disparate flag and status bits 
(Column 1, Lines 17-20. In a multi-threaded environment, each thread is working on a 
different part of the program, and would therefore be not be writing the same flag and 
status bits). 

34. As per Claim 24, Parady teaches: The method of claim 23, wherein the plurality 
of threads are each associated with a subset of the plurality of registers (Column 2, 
Lines 35-36). 

35. As per Claim 25, Parady teaches: The method of claim 24, further comprising 
context circuitry associated with the plurality of threads (Column 2, Lines 25-32, the 
program address registers), wherein the context circuitry is operable to track which of 
the plurality of threads is being processed by a particular stage (Column 3, Line 57 - 
Column 4, Line 30). 

36. As per Claim 26, Parady teaches: The method of claim 25, wherein the arithmetic 
logic unit is operable to process the plurality of threads, the arithmetic logic unit 
configured to process a particular thread for a single clock cycle before context 
switching to process a next thread, wherein each of the plurality of threads are 
processed for the single clock cycle before context switching (Column 4, Lines 42-48). 



Application/Control Number: 10/727,850 
Art Unit: 2183 



Page 13 



37. As per Claim 30, Owen teaches the programmable chip of claim 27, but fails to 
explicitly teach: wherein the processor is a multithreaded processor. 

Owen teaches a processor, namely a multiplier-accumulator unit and the 
supporting hardware to execute arithmetic operations, but does not teach about the 
overall system it could be implemented on. Parady teaches a multithreaded 
microprocessor (Parady Column 2 Lines 18-20) which interleaves threads on a cycle- 
by-cycle basis in round robin fashion (Column 3, Lines 18-19, and Column 2, Lines 6-8, 
which state that fine-grained processors interleave on a cycle-by-cycle basis, Column 2, 
Lines 32-34, and Column 4, Lines 9-10, the embodiment in which round-robin is used). 
Owen teaches that the advantage of his system is higher density, which leads to fewer 
parts, lower power requirements, and increases system speed due to interconnection 
delay being reduced (Column 3, Lines 32-40). Given these advantages, it would have 
been obvious to one of ordinary skill in the art at the time the invention was made to 
include the multiplier-accumulator circuit of Owens design into the multithreaded system 
of Parady's, to enhance the speed and decrease the power consumption of Parady's 
system. 

38. As per Claim 31 , Parady teaches: The programmable chip of claim 27, wherein 
providing flag and status information in each of the plurality of registers allows the 
processor to handle a plurality of threads, wherein each thread uses writes disparate 
flag and status bits (Column 1, Lines 17-20. In a multi-threaded environment, each 
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thread is working on a different part of the program, and would therefore be not be 
writing the same flag and status bits). 

39. Claims 15-17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Owen and Parady, further in view of Patterson et al. (herein Patterson). 

40. As per Claim 1 5, Parady teaches: The processor of claim 14, where in the 
processor comprises a plurality of stages (Figure 1), each stage having an associated 
context register. 

Patterson teaches that in a pipelined processor, pipeline registers must be used 
to save the data (or context) of each stage, to be able to be passed on to the next stage 
in the pipeline (A-9). This is an essential part of a pipelined processor to ensure that 
different stages of the pipeline do not interfere with each other (A-8). Therefore, it would 
have been obvious to one of ordinary skill in the art at the time the invention was made 
to include registers in each pipeline stage to maintain the context of the stage for the 
next stage. 

41 . As per Claim 16, Parady teaches: The processor of claim 15, wherein each 
thread is associated with a program counter (Column 2, Lines 25-32, the program 
address registers). 
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42. As per Claim 17, Parady teaches: The processor of claim 1 7, wherein the 
plurality of stages comprise reading the context register (Column 4, Lines 9-12, a thread 
has to be selected to run each clock cycle, requiring the register to be read to select 
one), accessing the program counter (Column 3, Lines 50-56), obtaining an instruction 
(Figure 1, Prefetch unit 16, also see Column 3, Lines 2-7), decoding the instruction 
(Figure 1, Decode unit 14), executing the operation (Column 3, Lines 18-19, the ALUs 
comprise an execution unit seen in Figures 1 and 3), and writing a result to one of the 
plurality of registers (Figures 1 and 3, where the output of the ALU (execution unit) is 
directly connected to the register file). 



Conclusion 

43. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure as follows. Applicant is reminded that in amending in response to 
a rejection of claims, the patentable novelty must be clearly shown in view of the state 
of the art disclosed by the references cited and the objections made. Applicant must 
also show how the amendments avoid such references and objections. See 37 CFR § 
1.111(c). 

44. Typaldos et al. (USPN 5,958,024) teaches a system where data registers also 
hold status flag information. 
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45. Cantwell (USPN 5,1 81 ,226) teaches a system where a carry bit is attached to a 
data register during a computation. 

46. Wennberg et al. teaches the use of sticky overflow bits in a counter. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Robert E. Fennema whose telephone number is (571) 
272-2748. The examiner can normally be reached on Monday-Friday, 8:00-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



Robert E Fennema 

Examiner 
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